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ABSTRACT: Multipliers are vital components of any processor or computing machine. More often 
than not, performance of microcontrollers and Digital signal processors are evaluated on the basis of 
number of multiplications performed in unit time. Hence better multiplier architectures are bound to 
increase the efficiency of the system. Vedic multiplier is one such promising solution. Its simple 
architecture coupled with increased speed forms an unparalleled combination for serving any complex 
multiplication computations. Tagged with these highlights, implementing this with reversible logic 
further reduces power dissipation. Power dissipation is another important constraint in an embedded 
system which cannot be neglected. In this paper we bring out a Vedic multiplier known as " Nikhilam 
Sutra multiplier". The — Nikhilam Navatascaram Dasatah literally means — All from Nine and the 
last from Ten. The sutra basically means start from the left most digit and begin subtracting = 9' from 
each of the digits; but subtract = 10' from the last digit.This will be implemented using reversible logic, 
which is the first of its kind. This multiplier may find applications in Fast Fourier Transforms (FFTs), 
and other applications of DSP like imaging, software defined radios, wireless communications. 

Index Terms: Reversible Logic, Nikhilam Sutra, vedic multiplier, Quantum cost, Total reversible logic 
implementation cost. 



I. INTRODUCTION 

Vedic mathematics is the ancient Indian system of mathematics which mainly deals with Vedic 
mathematical formulae and their application to various branches of mathematics. Vedic mathematics was 
reconstructed from the ancient Indian scriptures (Vedas) by Sri Bharati Krishna Tirtha after his research on 
Vedas . He constructed 16 sutras and 16 upa sutras after extensive research in Atharva Veda. The most famous 
among these 16 are Nikhilam Sutram,Urdhva Tiryakbhayam, and Anurupye. It has been found that Nikhilam is 
the most efficient among these. The beauty of Vedic mathematics lies in the fact that it reduces otherwise 
cumbersome looking calculations in conventional mathematics to very simple ones. This is so because the Vedic 
formulae are claimed to be based on the natural principles on which the human mind works. Hence 
multiplications in DSP blocks can be performed at faster rate. This is a very interesting field and presents some 
effective algorithms which can be applied to various branches of engineering. Digital signal processing (DSP) is 
the technology that is omnipresent in almost every engineering discipline. Faster additions and multiplications 
are the order of the day. Multiplication is the most basic and frequently used operations in a CPU. Multiplication 
is an operation of scaling one number by another. Multiplication operations also form the basis for other 
complex operations such as convolution, Discrete Fourier Transform, Fast Fourier Trans forms , etc. With ever 
increasing need for faster clock frequency it becomes imperative to have faster arithmetic unit. Hence Vedic 
mathematics can be aptly employed here to perform multiplication. 

Reversible logic is one of the promising fields for future low power design technologies. Since one of 
the requirements of all DSP processors and other hand held devices is to minimize power dissipation multipliers 
with high speed and lower dissipations are critical. This paper proposes an implementation of Reversible 
Nikhilam Multiplier which consists of two cardinal features. One is the fast multiplication feature derived from 
Vedic algorithm Nikhilam sutra and another is the reduced heat dissipation by the virtue of implementing the 
circuit using reversible logic gates. The paper is partitioned into Six sections. Section (ii) gives literature survey, 
Section (iii)deals with reversible logic. Section (IV) explains the Nikhilam sutra algorithm. Section (V) 
elaborates on the design aspects of Reversible Nikhilam vedic sutra Multiplier. Section (vi) Evaluation 
Conclusions and references follow. 
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II. LITERATURE SURVEY 

Energy loss is an important consideration in digital circuit design. A part of this problem arises from 
the technological non ideality of switches and materials. The other part of the problem arises from Landauer's 
principle for which there is no solution. Landauer's Principle states that logical computations that are not 
reversible necessarily generate k*T*ln(2) joules of heat energy, where k is the Boltzmann's Constant 
k=1.38xlO-23 J/K, T is the absolute temperature at which the computation is performed. Although this 
amount of heat appears to be small, Moore's Law predicts exponential growth of heat generated due to 
information lost, which will be a noticeable amount of heat loss in next decade. Also by second law of 
thermodynamics any process that is reversible will not change its entropy. On thermo dynamical grounds, 
the erasure of one bit of information from the mechanical degrees of a system must be accompanied by the 
thermalization of an amount of k*T*ln (2) joules of energy. The information entropy H can be calculated for 
any probability distribution. Similarly the thermodynamic entropy S refers to thermodynamic probabilities 
specifically. Thus gain in entropy always means loss of information, and nothing more. Design that does not 
result in information loss is called reversible. It naturally takes care of heat generated due to information loss. 
Bennett showed that zero energy dissipation would be possible only if the network consists of reversible logic 
gates, Thus reversibility will become an essential property in future circuit design technologies. 

III. REVERSIBLE LOGIC 

Reversible logic is a promising computing design paradigm which presents a method for constructing 
computers that produce no heat dissipation. Reversible computing emerged as a result of the application of 
quantum mechanics principles towards the development of a universal computing machine. Specifically, the 
fundamentals of reversible computing are based on the relationship between entropy, heat transfer between 
molecules in a system, the probability of a quantum particle occupying a particular state at any given time, and 
the quantum electrodynamics between electrons when they are in dose proximity. The basic principle of 
reversible computing is that a bi-jective device with an identical number of input and output lines will produce a 
computing environment where the electrodynamics of the system allow for prediction of all future states based 
on known past states, and the system reaches every possible state, resulting in no heat dissipation A reversible 
logic gate is an N-input N-output logic device that provides one to one mapping between the input and the 
output. It not only helps us to determine the outputs from the inputs but also helps us to uniquely recover the 
inputs from the outputs. Garbage outputs are those which do not contribute to the reversible logic realization of 
the design. Quantum cast refers to the cost of the circuit in terms of the cost of a primitive gate. Gate count is the 
number of reversible gates used to realize the function. Gate level refers to the number of levels which are 
required to realize the given logic functions. 

The following are the important design constraints for reversible logic circuits. 

1. Reversible logic gates do not allow fan-outs. 

2. Reversible logic circuits should have minimum quantum cost. 

3. The design can be optimized so as to produce minimum number of garbage outputs. 

4. The reversible logic circuits must use minimum number of constant inputs. 

5. The reversible logic circuits must use a minimum logic depth or gate levels. 

The basic reversible logic gates encountered during the design are listed below: 

1 . Feynman Gate : 

It is a 2x2 gate and its logic circuit is as shown in the figure. It is also known as Controlled Not 
(CNOT) Gate. It has quantum cost 1 and is generally used for Fan Out purposes. 

2. Peres Gate : 

It is a 3x3 gate and its logic circuit is as shown in the figure. It has quantum cost 4. It is used to realize 
various Boolean functions such as AND, XOR. 

3. Fred kin Gate : 

It is a 3x3 gate and its logic circuit is as shown in the figure. It has quantum cost 5. It can be used to 
implement a Multiplexer. 

4. HNG Gate : 

It is a 4x4 gate and its logic circuit is as shown in the figure. It has quantum cost 6. It is used for 
designing ripple carry adders. It can produce both sum and carry in a single gate thus minimizing the garbage 
and gate counts. 
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5.Toffoli Gate : The 3*3 Reversible gate with 3 inputs and 3 outputs. It has Quantum cost 5 




Figure 1 . Toffoli gate 
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Figure 2. 
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Figure 3. Peres gate 
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Figure 4. Fredkin gate 
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Figure 5. HNG gate 

IV. NIKHILAM SUTRA MULTIPLICATION ALGORITHM 

The — Nikhilam Navatascaram Dasatah literally means — All from Nine and the last from Ten. The 
sutra basically means start from the left most digit and begin subtracting = 9' from each of the digits; but subtract 
_10' from the last digit.The following example illustrates the way in which this Sutra could reduce the number 
of iterations to reduce the whole Multiplication. 

To multiply 92 and 89. Apply Nikhilam Sutra All from nine and last from ten on both the numbers — 
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Nikhilam 

89 : -1 1 

92 ^^^^ -8 

Figure 1 

The arrows in Figure 2 indicate the operation of the Nikhilam 
Sutra being performed, viz. the subtraction of 10 from the last 
digit and 9's from all the other digits starting with the leftmost 
digit. 

□ Now we write this down side-by-side, 

92 -08 
89 -11 



□ Multiply (-08) and (-1 1) to get = 88'. 



-OS 
-1 1 



□ Now we cross-add. This is done by both — adding 92 
and -1 1 to get 8 lor — adding 89 and -08 to get 81. 




-OS 
-11 



□ Note that in both operations you get the same answer 
that is = 8 r which is written below to get the solution. 



-OS 

-1 1 



81 



This technique works very well if the numbers to be multiplied are near a base. Upon little alteration, 
this also works very well for any pair of numbers. 

Case (i): Both the numbers are lower than the base. 

EXP.l 986*989. Base is 1000 

986 14 
989 11 



986-11 14*11 
989-14 



975 154 
So 986*989=975154 

Case (ii): Both the numbers are higher than the base. 

EXP.2 104*102. Base is 100 

104 04 
102 02 



104+2 04*02 
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102+4 

106 08 
So 104*102=10608 

Case (iii): One number is more and other is less than the base. 

EXP.3 998*1025.Base is 1000 

998 002 
1025 025 



998-25 
1025+2 



2*25 



1023 50 ( since the complement of 50 is 950 for the base 1000) 

We get 1023 950 But we have to subtract 1 from 1023 
So 998*1025= 1022950 

After this illustration, we now discuss the operational principle of Nikhilam Sutra by taking the case of 
multiplication of two 

n-bit numbers x and y having complements x = lOn - x and y = lOn - y respectively. 
The required product jp' is defined as: p = xy ... (1) 

which can be reframed by adding and subtracting 102n +10n(x + y) to the right hand side as: 
p = xy + 102n - 102n + lOn x + y - (x +y) ... (2) 

The above terms can be clubbed as follows: 

p = lOn x + y - 102n+ 102n - lOn x + y + xy= lOn x + y - lOn + lOn - x lOn - y 
= lOn x-y + xy = lOn y - x + {x y }... (3) 

From (3), the expressions of LHS and RHS can be deduced, 
which come out to be: 
LHS = x y={y } ... (4) 
RHS ={}... (5) 

Hence the multiplication of two n- bit numbers is reduced to the multiplication of their complements. To take 
full advantage of this reduction, it should be ensured that the 

numbers obtained after taking the complements are lesser than the original numbers. 

This condition is satisfied if both the original numbers are greater than 10n=2, i.e., x > 10n=2 and y > 10n=2. 

This is the reason why it is said that the Nikhilam Sutra is 

efficacious in the multiplication of large numbers than the smaller ones. 

V. DETAILED DESIGN (PROPOSED ARCHITECTURE) 
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Figure 2 
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5.1. Top Module: 

The block diagram for proposed multiplier is shown in the figure2. As we are using binary numbers in 
digital signal processing applications we have implemented for binary system. The multiplication can be done 
using the complementer, CSA adder and adder. The RHS of the product can be obtained by the multiplication of 
complimented outputs of multiplier and multiplicand and the LHS of the product can be obtained by addition 
using a CSA. This can be used for the multiplication of any number of bits. In this paper, we have presented a 
4x4 architecture applying the Nikhilam Algorithm. In the figure 2, the two inputs a and b represents the 4 bit 
multiplier and 4 bit multiplicand respectively. 



5.2. Internal blocks: 

5.2.1 2's complementer: 

The multiplicand and the multiplier are given as inputs to the two 2's complementer blocks. The logic 
implementation of the 4 bit 2's complementer is presented in figure3 




c{4) c{3) 



In figure3, the — HA represents a Half Adder block. 



Figure3 



5.2.2 Multiplier: 

Now, the complemented output of multiplier (-a) and the complemented multiplicand (-b) are then 
produced. These complemented outputs of multiplier and multiplicand are given as inputs to the multiplier 
block. The 4x4 multiplier architecture that we employed is based on calling a 2x2 multiplier so as to ease the 
multiplication procedure. This implementation is represented by the 

following figure 4. Here, a and b are the 2-bit (or the 4-bit) multiplier and multiplicand respectively which are 
being multiplied to produce the final 4-bit(or the 8-bit) product vector.The half of the LSB bits of multiplication 
output is taken as RHS product of the total multiplication. 
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5.2.3 CSA: 

The carry-save unit consists of n full adders, as shown inFigure5 each of which computes a single sum 
and carry bitbased solely on the corresponding bits of the three input numbers. 
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n=4 in the evaluated multiplier of this paper 

Figure 5 

With three n - bit numbers ai, bi, and ci given to it, it produces a partial sum psi and a shift-carry sci 
according to the below equations :- 
Psi=ai □ bi □ ci 

Sci=(ai □ bi) V (ai □ ci) V (bi □ ci) 

These psi and sci are then added using a conventional adder, to produce the sum of the three inputs. 
The multiplier, multiplicand and the half of MSB bits are given as inputs to the CSA. The two outputs i.e., sum 
vector and carry vector obtained from the CSA adder are given to the 

inputs for the adder block. The output he we obtain is labeled the LHS of the required multiplication product. As 
we are using CSA, the delay will be reduced and the number of components gets reduced for addition 
mechanism. This is the main advantage of the multiplier based on Nikhilam over the multiplier based on 
conventional algorithms 

proposed in the a fore mentioned algorithms in = P. 



VI. EVALUATION 

The design of the reversible 2,4,8,16 bit multiplier is logically verified using XILINX 8. li and 
MODELSIM SE 6.5. Synthesis and Implementation is done using Xilink Spartan-3E Fpga Board of device 
family xc3s500e-4fg320.The simulation results are as shown in figures . 

Here we introduce a new function called the "Total Reversible Logic Implementation Cost (TRL1C)" 
which is defined as the sum of all the cost metrics of a given reversible circuit. The TRLIC can be deemed as a 
parameter which reflects the overall performance of a reversible logic circuit. TRLIC=X(NG,CI,QC,GO) 

Where NG is the number of gates in the reversible circuit.CI is the number of constant inputs, QC is the 
quantum cost of the circuit. GO is the number of garbage outputs. The following are the important design 
constraints for any reversible logic circuits. 

1. Reversible logic circuits should have minimum quantum cost. 

2. The design can be optimized so as to produce minimum number of garbage outputs. 

3. The reversible logic circuits must use minimum number of constant inputs. 

4. The reversible logic circuits must use a minimum number of reversible gates. 

6.1. Simulation Methodology 

Xilinx 8. li has been used to simulate the wave forms. The simulator carefully modeled the 
interconnections, the associated blocks and the propagation delays. 




RTL Schematic 
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6.2.Results 

In this section we show results for the Vedic multiplier based on Nikhilam Sutra . Multiplier based on 
Nikhilam Algorithm utilizes smaller area and produces littler delay than the conventional multiplier. 



Simulation Results for 2 bit Multiplication 
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Simulation Results for 4 bit Multiplication 
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Simulation Results for 8 bit Multiplication 
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Simulation Results for 16 bit Multiplication 
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CONCLUSION 

The proposed Vedic multiplier architecture exhibits speed improvements. The Vedic multiplier 
employing Nikhilam Sutra found to be better than conventional multiplier in terms of speed when magnitude of 
both operands are more than half of their maximum values . This approach may be well suited for multiplication 
of numbers with more than 16 bit size. 
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